Automated Programmable Code Transformation For Portable Performance Tuning

نویسنده

  • Qing Yi
چکیده

We present a framework which uses POET, an interpreted code transformation language, to effectively combine programmable control from developers, advanced optimizations by compilers, and flexible empirical tuning of optimizations to achieve portable high performance for scientific computing. We have extended ROSE, a C/C++/Fortran source-to-source compiler, to automatically analyze scientific computing benchmarks for memory performance optimizations. Instead of directly generating optimized code, our ROSE optimizer produces parameterized POET scripts as output. The auto-generated POET optimization script is then ported to different machines for portable performance tuning. Our results show that this approach is highly effective, and the code optimized by the auto-generated POET scripts can significantly outperform those optimized using the ROSE compiler alone.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Self-tuning Zone PID Controller for Temperature Control via a PLC code

S7-1200 with Tia Portal technology has become a Standard function of distributed controlsystems. Self-Tuning methods belong to Programmable Controllers (PLC) techniques. PLCtechniques contain software packages for advanced control based on mathematical methods. S7-1200 tools are designed to increase the Process Capacity, yield, and quality of products. Most ofthe present time digital industry r...

متن کامل

The DKU Pattern for Performance Portable Parallel Programming

The shift to an ever increasing number of cores on a chip is driving the need for parallel programming methods that allow “single source, multiple hardware, high performance on each”. The difficulty of designing such methods is exemplified in the embedded industry. Here, the tradition is hand-coding for ultra-high performance on specialized architectures, so the single source must be automatica...

متن کامل

Methodology and Toolset for Model Verification, Hardware/Software co‐simulation, Performance Optimisation and Customisable Source‐code generation

The MODUS project aims to provide a pragmatic and viable solution that will allow SMEs to substantially improve their positioning in the embedded-systems development market. The MODUS tool will provide a model verification and Hardware/Software co‐simulation tool (TRIAL) and a performance optimisation and customisable source‐code generation tool (TUNE). The concept is depicted in automated mode...

متن کامل

On Using Programmable Delay Tuning Elements To Improve Performance, Reliability, and Testing of Digital ICs

The number of speed-limiting paths in modern digital integrated circuits (ICs) is in the range of millions. Due to un-modelled electrical effects and process variations in advanced fabrication technologies, it is difficult for pre-silicon timing analysis tools to provide accurate delay estimates. Hence, programmable delay elements are commonly inserted in high-performance circuits in order to p...

متن کامل

Avenue Argonne , Illinois 60439 SPAPT : Search Problems in Automatic Performance Tuning

Automatic performance tuning of computationally intensive kernels in scientific applications is a promising approach to achieving good performance on different computing architectures while preserving the kernel implementation’s readability and portability. A major bottleneck in automatic performance tuning is the computation time required to test the large number of possible code variants, whi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010